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A METHOD AND APPARATUS FOR DOMAIN TRANSFORMATION MULTIPLE 

SIGNAL PROCESSING 

Field of the Invention 

The invention relates generally to network communications. More particularly, the 
invention relates to a method and apparatus for domain transformation multiple signal 
processing. 

Background of the Invention 

High-speed networks are continually evolving. The evolution includes a continuing 
advancement in the operational speed of the networks. The network implementation of choice 
that has emerged is Ethernet networks physically connected over unshielded twisted pair wiring. 
Ethernet in its 10BASE-T form is one of the most prevalent high speed LANs (local area 
network) for providing connectivity between personal computers, workstations and servers. 

High-speed LAN technologies include 100BASE-T (Fast Ethernet) and 1000BASE-T 
(Gigabit Ethernet). Fast Ethernet technology has provided a smooth evolution from 10 Megabits 
per second (Mbps) performance of 10BASE-T to the 100 Mbps performance of 100BASE-T. 
Gigabit Ethernet provides 1 Gigabit per second (Gbps) bandwidth with essentially the simplicity 
of Ethernet. There is a desire to increase operating performance of Ethernet to even greater data 
rates. 

Figure 1 shows a block diagram of an Ethernet transceiver pair communicating over a bi- 
directional transmission channel, according to the prior art. The transceiver pair includes a first 
transceiver 100 and a second transceiver 105. The first transceiver 100 includes a transmitter 
section 110 that receives digital data for transmission over a transmission channel 135. The first 
transceiver 100 also includes a receiver section 120 that receives data. 
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The transceiver includes a digital to analog converter (DAC) for transmission, and an 
analog to digital converter (ADC) for reception. The hybrid circuit 130 is designed to reduce the 
level the transmit signal present in the receive signal path. The transmitter section 1 10 and the 
receiver section 120 are connected to a common twisted pair causing some of the transmission 
signals of the transmitter section 1 10 to be coupled into the receive signals of the receiver section 
120. The coupled signal can be referred to as an "echo" signal. 

The hybrid circuit 140 of the second transceiver 105 operates in the same manner as the 
hybrid circuit 130 of the first transceiver 100. The transmitter section 150 and the receiver 
section 160 of the second transceiver 105 operate in the same manner as the transmitter section 
1 10 and receiver section 120 of the first transceiver 100. 

An implementation of high speed Ethernet networks includes simultaneous, full 
bandwidth transmission, in both directions (termed full duplex), within a selected frequency 
band. When configured to transmit in full duplex mode, Ethernet line cards are generally 
required to have transmitter and receiver sections of an Ethernet transceiver connected to each 
other in a parallel configuration to allow both the transmitter and receiver sections to be 
connected to the same twisted wiring pair for each of four pairs. 

Figure 2 shows several Ethernet twisted pair LAN connections 212, 214, 216, 218 in 
parallel. The first connection 212 is between a first transmitter 1 15a (SI A) and first receiver 
125a (Rl A), and a second transmitter 1 15b (SIB) and a second receiver 125b (RIB). The 
second connection 214 is between a third transmitter 135a (S2A) and third receiver 145a (R2A), 
and a fourth transmitter 135b (S2B) and a fourth receiver 145b (R2B). The third connection 216 
is between a fifth transmitter 155a (S3 A) and fifth receiver 165a (R3A), and a sixth transmitter 
155b (S3B) and a sixth receiver 165b (R3B). The fourth connection 218 is between a seventh 
transmitter 175a (S4A) and seventh receiver 185a (R4A), and an eighth transmitter 175b (S4B) 
and an eighth receiver 185b (R4B). 
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The twisted pair LAN connections 212, 214, 216, 218 are located physically proximate, 
and interference between the twisted pairs 212, 214, 216, 218 is caused by interactions between 
signals of the twisted pair LAN connections 212, 214, 216, 218. The interference is in the form 
of far end cross-talk (FEXT) and near-end cross-talk (NEXT). NEXT is caused by interference 
due to signals generated at the near-end of a neighboring twisted pair connection. For example, 
NEXT interference includes the transmitter signals SI A, S3A, S4A of transmitters 115a, 155a, 
175a interfering with receiver signal R2A of receiver 145a. FEXT is caused by interference due 
to signals generated at the far-end of a neighboring twisted pair connection. For example, FEXT 
interference includes the transmitter signals SIB, S3B, S4B of transmitters 115b, 155b, 175b 
interfering with receiver signal R2A of receiver 145a. Other interference includes the echo 
signal. For example, the echo signal includes interference the signal S2A of transmitter 135a 
interfering with the receiver signal R2A of receiver 145a. Additional interference includes inter- 
symbol interference (ISI). ISI is self-interference of the transmit signal S2B at the input R2A of 
the receiver 145a. Other interference can include alien signal interference. Alien signal 
interference generally includes interference due to other Ethernet twisted pair LAN connections 
of cables that may be proximate to the twisted pair cable of the signal of interest. 

Present Ethernet technology can include time domain processing of digital signal streams 
for minimization of signal interference. As the data frequencies of the digital signal streams 
increases, the electronic hardware required to implement the time domain processing increases 
dramatically. 

Digital filtering is generally used to reduce the signal interference of Ethernet signals. 
Digital communications systems use filtering for many functions. These functions include 
adjacent and co-channel interference rejection, equalization, echo canceling and cross-talk 
canceling. Finite impulse response (FIR) filtering can be utilized to reduce signal interference. 

FIR filtering can require complex circuit implementations. For example, if an FIR filter 
has a length P (samples), P multiply and accumulate (MAC) operations are required per filtered 
output signal. High performance communication systems (this generally refers to high 
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throughput systems) the length of the FIR filters can be much greater. The electronic circuitry 
required to implement high performance FIR filters can become very large, requiring greater cost 
and higher power dissipation. High performance filters can require lengths (P) of 50-1000 taps 
in which each tap operates on a sampled signal delayed by one (or fraction of one) symbol period 
from the previous tap. Additionally, high performance systems can require several filters. 

A Gigabit Ethernet system can require echo, NEXT and FEXT cancellation and 
equalization. Additionally, Ethernet systems generally include 4 adjacent twisted pair 
connections per communication link, requiring NEXT and FEXT cancellation for each of the 
pairs. The twisted pairs of a communication link can additionally alien NEXT cancellation due 
to interference received from other twisted pair communication links. 

It is desirable to have an apparatus and method for a high throughput transceiver that 
provides for pre-processing and post-processing of digital signal streams for minimization of 
interference of Ethernet LAN signals. The processing should require a minimal amount of 
electronic hardware, and dissipate a minimal amount of power. Alternatively, the processing 
should enable higher data transmission rates, allow for longer transmission channels using 
comparable hardware and power dissipation. 
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Summary of the Invention 

The invention includes an apparatus and method for post-processing and pre-processing 
of digital signal streams for minimization of interference (including self-interference, ISI and 
cross-talk interference) of Ethernet LAN signals. The processing can be implemented with a 
minimal amount of electronic hardware, dissipate a minimal amount of power, and provide better 
performance. 

An embodiment of the invention includes an Ethernet transceiver. The Ethernet 
transceiver includes a plurality of digital signal streams, at least one digital signal stream is 
coupled to another of the digital signal streams. A transform block transforms a plurality of the 
digital signal streams from an original domain into a domain that allows lower complexity 
processing. A processor for joint processing the transformed digital signal streams, each joint 
processed digital signal stream being influenced by other digital signal streams. An inverse 
transform block inverse transforms the joint processed signal streams back to the original 
domain. 

Another embodiment of the invention includes a method of joint processing a plurality of 
digital signal streams. A first act of the method includes transforming a plurality of the digital 
signal streams from an original domain into a domain that allows lower complexity processing. 
A second act of the method includes joint processing of the transformed digital signal streams, 
each joint processed digital signal stream being influenced by characteristics of other digital 
signal streams. A third act includes inverse transforming the joint processed signal streams back 
to the original domain. 

Other aspects and advantages of the present invention will become apparent from the 
following detailed description, taken in conjunction with the accompanying drawings, illustrating 
by way of example the principles of the invention. 

Brief Description of the Drawings 
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Figure 1 shows a block diagram of a transceiver pair communicating over a bi-directional 
transmission channel, according to the prior art. 

Figure 2 shows a plurality of transceiver pairs located adjacently, and suffering from 
cross-talk coupling between signal streams of the transceiver pairs, according to the prior 
art. 

Figure 3 shows an Ethernet transceiver, according to an embodiment of the invention. 

Figure 4 shows an Ethernet receiver, according to an embodiment of the invention. 

Figure 5 shows greater detail of a portion of an Ethernet receiver, according to an 
embodiment of the invention. 

Figure 6 shows an Ethernet transmitter, according to an embodiment of the invention. 

Figure 7 shows greater detail of an Ethernet transmitter that includes near-channel 
processing 

Figure 8 shows greater detail of another Ethernet transmitter that includes far-channel 
processing. 

Figure 9 shows acts of a method of joint processing a plurality of digital signal streams, 
according to an embodiment of the invention. 

Detailed Description 

As shown in the drawings for purposes of illustration, the invention is embodied in an 
apparatus and method for a high throughput transceiver that includes signal processing for 
minimizing interference (self-interference, ISI and cross-talk interference) between parallel 
signals, and minimizes the effects of echo signals. 

Figure 3 shows several Ethernet twisted pair LAN connections 312, 314, 316, 318 in 
parallel, according to an embodiment of the invention. This embodiment includes joint 
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processors 301, 391 which include joint processing of signals transmitted and received over the 
twisted pair LAN connections 312, 314, 316, 318. The joint processing reduces the effects of 
interference and echo signals, on signals transmitted and received over the twisted pair LAN 
connections 312, 314, 316, 318. 

The first connection 312 is between a first transmitter 315a (S1A) and first receiver 325a 
(R1A), and a second transmitter 315b (SIB) and a second receiver 325b (RIB). The second 
connection 314 is between a third transmitter 335a (S2A) and third receiver 345a (R2A), and a 
fourth transmitter 335b (S2B) and a fourth receiver 345b (R2B). The third connection 316 is 
between a fifth transmitter 355a (S3A) and fifth receiver 365a (R3A), and a sixth transmitter 
355b (S3B) and a sixth receiver 365b (R3B). The fourth connection 318 is between a seventh 
transmitter 375a (S4A) and seventh receiver 385a (R4A), and an eighth transmitter 375b (S4B) 
and an eighth receiver 385b (R4B). 

The transmission signals SI A, SIB, S2A, S2B, S3A, S3B, S4A, S4B include digital 
signal streams. Due to the close proximity of the transmission signals SI A, SIB, S2A, S2B, 
S3A, S3B, S4A, S4B and Rl A, RIB, R2A, R2B, R3A, R3B, R4A, R4B the digital signal 
streams are coupled, causing both far-end cross-talk (FEXT) and near-end cross-talk (NEXT) 
interference within the digital signal streams. Additionally, echo signals interfere with each of 
the digital signal streams. 

Coupling of signals includes any signal stream appearing within another signal stream. 
The coupling can be due to a direct electrical connection, or due to inductive or capacitive 
coupling of the signal streams. Ethernet channels can include all of these types of coupled 
signals. 

NEXT is caused by interference due to signals generated at the near-end of a neighboring 
twisted pair connection. For example, NEXT interference includes the transmitter signals SI A, 
S3 A, S4A of transmitters 315a, 355a, 375a interfering with receiver signal R2A of receiver 345a. 
FEXT is caused by interference due to signals generated at the far-end of a neighboring twisted 
pair connection. For example, FEXT interference includes the transmitter signals SIB, S3B, 
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S4B of transmitters 315b, 355b, 375b interfering with receiver signal R2A of receiver 345a. 
Other interference includes the echo signal. For example, the echo signal includes interference 
the signal S2A of transmitter 335a interfering with the receiver signal R2A of receiver 345a. 
Additional interference includes inter-symbol interference (ISI). ISI is self-interference of the 
transmit signal S2B at the input R2A of the receiver 345a. Other interference can include alien 
signal interference. Alien signal interference generally includes interference due to other 
Ethernet twisted pair LAN connections. 

Figure 4 shows an Ethernet receiver, according to an embodiment of the invention. The 
embodiment of Figure 4 is a receiver that includes an analog front end 410, a transform section 
420, a joint processing section 430, and a reverse transform section 440. The hybrid circuit front 
end 410 receives a plurality (here, there are four) of transmission signals Rl, R2, R3, R4. The 
received transmission signals Rl, R2, R3, R4 can represent either of the earlier described 
received signals Rl A, R2A, R3A, R4A or RIB, R2B, R3B, R4B. 

The transform section 420 transforms each of the digital signal streams from an original 
domain into a lower complexity domain. An implementation of the transform includes a discrete 
Fourier transform (DFT) that transforms the digital signal streams from the time domain to the 
new domain. An efficient discrete Fourier transform is a fast Fourier transform (FFT). 
However, other examples of possible transforms include a discrete cosine transform, a discrete 
wavelength transform, a discrete Hartley transform and multi-rate filter transforms. The general 
premise is that the transform provides a different domain in which processing of the digital 
signal streams can be implemented with less complex electronic circuitry (for example, less 
multiplies and accumulates, slower clocks, etc.). 

The joint processing section 430 joint processes the transformed digital signal streams. 
Each joint processed digital signal stream is influenced by each of the other digital signal 
streams. Various embodiments of the joint processing section 330 include processing that 
reduces the effects of NEXT and FEXT interference, and reduces the effects of echo signals. 
The transformed digital signal streams are jointly processed. That is, the processing of each 
digital signal stream is dependent upon characteristics (interference and echo) of the other digital 
signal streams being processed. More specifically, for an Ethernet system that includes four 
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digital signal streams (four transmitting streams, and four receiving streams) electrically coupled 
to four neighboring twisted pair of a communication link, the joint processing of each 
transformed digital signal stream is dependent upon interference caused by the other digital 
signal streams. 

The joint processing can include a matrix multiplication of each of the digital signal 
streams. The matrix multiplication of the four transformed digital signal streams generates four 
jointly processed outputs. The elements of the matrix are selected to reduce the effects caused 
by interference due to coupling of the digital signal streams during transmission over the 
communications link, and self-interference, such as ISI. The elements can be additionally 
influenced by the effects of echo signals. 

The inverse transform section 440 inverse transforms the joint processed signal streams 
back to the original domain. Generally, the original domain is the time domain. 

Figure 5 shows greater detail of an Ethernet receiver, according to an embodiment of the 
invention. This embodiment includes four digital signal streams (Rl, R2, R3, R4) being received 
by the Ethernet receiver. Figure 5 only shows the joint processing for a single digital signal 
stream (SI). Similar joint signal processing is generally included for each of the other digital 
signal streams (S2, S3, S4). All of the signal streams (Rl, R2, R3, R4) are used to estimate a 
single signal stream. The estimated signal stream of Figure 5 is the first far end signal stream SI. 

The analog front end of Figure 5 has been simplified by only showing four analog to 
digital converters (ADCs) 512, 514, 516, 518. The signals transmitted over the Ethernet twisted 
pairs are analog signals. The analog signals are modulated and include the information of the 
digital signal streams. Effectively, each ADC 512, 514, 516, 518 receives a corresponding 
analog signal, and converts the analog signal into a digital signal. Other front-end functions of 
the Ethernet receiver are not shown in Figure 5 for clarity. 

The digital signal streams (Rl, R2, R3, R4) are transformed to a simpler domain 
(requiring less complex electronic circuitry) by transform blocks 522, 524, 526, 528. 
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Processors 532, 534, 536, 538, and summer 540, jointly process the transformed digital 
signal streams. An embodiment of the joint processors includes the first digital signal stream Rl 
joint processor 532 being a feed forward equalizer, the second digital signal stream R2 joint 
processor 534 being a feed forward far end cross-talk filter, the third digital signal stream R3 
joint processor 536 being a feed forward far end cross-talk filter, and the fourth digital signal 
stream R4 joint processor 538 being a feed forward far end cross-talk filter. The joint processing 
allows for recovery of the transmitted signal stream SI, and reduction of the interference caused 
bySl,S2, S3,S4. 

Each of the joint processors can be implemented as vector multipliers that multiply the 
digital signal streams Rl, R2, R3, R4 with a vector of coefficients. The coefficients of the 
vectors are selected to minimize interference caused during transmission of the signals SI, S2, 
S3, S4 over the communications link. The coefficients of the vectors can additionally be 
influenced to reduce the effects of interfering signals from adjacent Ethernet communications 
links. These interfering signals are sometimes referred to as alien near end cross-talk signals 
(ANEXT). 

The embodiment of Figure 5 includes a summer 550 that sums the jointly processed 
digital signal streams. 

An inverse transform block 550 inversely transforms an output of the summer 550. The 
output of the inverse transform is a first estimate of the first digital signal stream SI. Additional 
time processing can be included to refine the first estimate. 

Figure 6 shows an Ethernet transmitter, according to an embodiment of the invention. 
The transmitter receives digital signal streams ST, S2\ S3', S4' for transmission over an 
Ethernet channel. The joint processing of the invention is used in this embodiment to pre- 
process the digital signal streams before the digital signal streams are transmitted. The pro- 
processing reduces the interference between the digital signal streams both before and after 
transmission over the Ethernet channel. 
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Transform block 620 transforms each of the digital signal streams from an original 
domain into a lower complexity-processing domain. The lower complexity domain allows the 
joint processing of the digital signal streams to be more easily implemented. 

Ethernet joint signal processor 630 joint processes the transformed digital signal streams, 
each joint processed digital signal stream being influenced by other digital signal streams. As 
will be describe later, the joint processing can include matrix multiplication of vectors of digital 
signal stream with a joint processing matrix. The elements of the joint processing matrix can be 
dynamically determined to allow a continuous reduction of transmission interference. 

Inverse transform block 640 inverse transforming the joint processed signal streams back 
to the original domain. 

Front end transceiver 610 generates analog signals from the processed digital signal 
streams for transmission over an Ethernet channel. 

Figure 7 shows greater detail of an Ethernet transmitter that includes near-channel 
processing. The near-channel processing includes NEXT processing and echo processing. The 
processing generally includes filtering the transmission signals Sl\ S2\ S3', S4\ The filtered 
(joint processed) output can be summed with the received signals to reduce the effects of the 
NEXT and echo interference. 

The transmission signals ST, S2\ S3', S4' are passed through a FEC (forward error 
corrector) 710. 

A DFT block 720 performs a discrete Fourier transform on the transmission signals. 

A NEXT block 730 performs near-channel signal processing of the transmission signals. 
The near-channel signal processing provides estimates the NEXT interference of the 
transmission signals. 

An echo block 740 performs echo processing of the transmission signals. The near- 
channel signal processing also provides estimates the echo signal interference of the transmission 
signals. 
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The processes signal streams are summed with a received signal stream Rl A to reduce 
the effects of near channel interference on the received signal stream R1A. Specifically, the 
estimates of the NEXT interference and the echo interference are summed with the received 
signal stream Rl A to minimize the actual NEXT and echo interference. 

An inverse discrete transform (EDFT) 725 transforms the correction signals (NEXT and 
echo) back to the original (generally, time) domain. 

A DAC 735 converts the correction signal from a digital signal to an analog signal, 
allowing the correction signal to be summed with an analog received signal stream R1A. 
Another embodiment includes the correction signal being summed with the received signal 
stream as an analog signal rather than as a digital signal. Another embodiment includes the 
correction signal being summed with the received signal stream before being converted back to 
the original domain. That is, before being transformed back to the original domain by the IDFT 
725. 

An ADC 790 converts the corrected received signal into a digital signal stream for 
additional receiver processing. 

Figure 8 shows greater detail of another Ethernet transmitter that includes far-channel 
processing. This embodiment includes a FEXT processor 830 which generates a far-channel 
correction signal to be summed with a transmit signal SI A before being transmitted. The 
correction signal reduces the effects of FEXT interference by summing a correction signal with 
the transmit signal SI A. 

The FEC 710 and DFT 720 include the same designators as in Figure 7 to show that they 
can be used for both FEXT and NEXT processing. 

The transmit signal SI A is passed through a FIFO 850 and a filter 860. 

The correction signal of the FEXT processor 830 and the transmit signal SI A are 
summed, preprocessing the transmit signal S1A to reduce the effects of FEXT and ISI 
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interference suffered by the transmit signal SI A during transmission through an Ethernet 
channel. 

Figure 8 includes an IDFT 770 and a DAC 715 which convert the preprocessed transmit 
signal back to the original domain, and then into an analog signal for transmission through the 
Ethernet channel. 

Figure 9 shows acts of a method of joint processing a plurality of digital signal streams, 
according to an embodiment of the invention. 

A first act 910 includes transforming each of the digital signal streams from an original 
domain into a lower complexity domain. 

A second act 920 includes joint processing of the transformed digital signal streams, each 
joint processed digital signal stream being influenced by characteristics of other digital signal 
streams. 

A third act 930 includes inverse transforming the joint processed signal streams back to 
the original domain. 

Matrix Joint Signal Processing 

Joint matrix signal processing of the invention includes estimating the transmission 
characteristics of the Ethernet signals of an Ethernet connection. The characteristics include 
interference (NEXT, FEXT, ISI, ANEXT) and echo signals. 

The transmission characteristics can generally be divided into two matrices, a near- 
channel matrix, and a far-channel matrix. Estimates of the near-channel matrix and the far- 
channel matrix are used in the joint processing of the Ethernet signals. The near-channel matrix 
and a far-channel matrix can be estimated by transmitting known digital signal streams, and 
analyzing the resulting response at a desired receiver. 
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Near-Channel Matrix 

Referring to Figure 3, the receivers associated with transceiver A and the joint processor 
301, receive Ethernet digital signal streams R1A, R2A, R3A, R4A when the transmitters 315a, 
335a, 355a, 375a are transmitting, and transmitters 315b, 335b, 355b, 375b are not transmitting. 
A channel matrix ha can be used to approximate the signal streams R1A, R2A, R3A, R4A 
having been transmitted as digital signal streams SI A, S2A, S3 A, S4A. That is, transceiver A 
receives the digital signal streams SI A, S2A, S3 A, S4A after the digital signal streams pass 
through a near-end Ethernet transmission channel ha. The received digital signal streams Rl A, 
R2A, R3A, R4A can be approximated as (neglecting noise and alien cross-talk): 

R1A= hall© S1A + hal2© S2A + hal3© S3A + hal4© S4A 
R2A= ha21© S1A + ha22© S2A + ha23© S3A + ha24© S4A 
R3A= ha31© S1A + ha32© S2A + ha33© S3A + ha34© S4A 
R4A= ha41© SI A + ha42© S2A + ha43© S3 A + ha44© S4A 

where the hij elements of the ha matrix are the impulse responses of the coupling that generates 
the interference of the received digital signal streams R1A, R2A, R3A, R4A, and where the 
symbol © denotes a convolution. 

A vector ra can be used to represent a vector that includes the received digital signal 
streams. More specifically, ra = [Rl A, R2A, R3A, R4A]. A vector sa can be used to represent 
the transmitted digital signal streams. More specifically, sa = [SI A, S2A, S3 A, S4A]. For this 
representation, ra = ha © sa, and 



ha = 
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This matrix equation of ha, provides a model of the echo and NEXT interference signals, 
this matrix can be generated by transmitting known signals, and observing the resulting received 
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signals. The ha estimation can be performed, for example, during power-up of the Ethernet 
transceivers. The estimation of the ha matrix allows the determination of joint processing 
(filters) for reducing effects of echo and NEXT cross-talk. 

The diagonal terms hjj of the ha matrix represent the impulse responses of the echo 
signal coupling. The off-diagonal terms hij (i^j) of the ha matrix represent the impulse 
responses of the NEXT coupling. The transceiver (transceiver A) has information regarding the 
ra and sa vectors, and can use this information to approximate ha. The transceiver can use the 
known information of ra and sa to generate processing (filtering) to approximate ha. An 
estimate of est_ha can be adaptively determined by minimizing the error of: 

est_ha © sa - ra = error. 

The error minimization can also be determined in the transformed domain. That is, the 
above equation could have been minimized before inverse transforming all of the components 
back to the original domain. 

The estimation can be performed adaptively using a least mean square (LMS) or a 
recursive least square (RLS) algorithm. Both during and after convergence of the algorithms, the 
transceiver computes an est Jia, and filters the vector signals sa based upon the est_ha. The 
receiver portions of the transceiver subtract the joint aggregate signal (estjha © sa) from the 
received signals ra. More specifically, the receivers compute a minimization of: 

ra - (estjia © sa) = (ha © sa) - (est_ha © sa). 

As previously stated, during calibration, ra and sa are known quantities. By determining 
an estimate of the near-end channel (est_ha), joint processing can be performed on received and 
transmitted signal streams to minimize signal interference. The joint processing can be 
simplified for high throughput Ethernet transmission channels, by performing the joint 
processing in a less-complex domain. 
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The transmitted digital signal steams sa are transformed (represented by Sa) to the lower 
processing complexity domain. The estimate of the near-end channel matrix est_ha is 
transformed (represented by est_Ha) to the less complex domain. A near-end correction signal 
(Ena) can be determined by multiplying the transformed signal steam Sa with the transformed 
near-end channel matrix est_Ha). The near-end correction signal (Ena) is then transformed 
(ena) back to the original domain. The inverse transformed signal ena can be subtracted from 
received signals to reduce the effects of near-channel interference. 

For typical Ethernet values of P and N, high echo and NEXT interference cancellation 
can be achieved with lower complexity processing. Moreover, if the receiver is also performing 
frequency domain processing of the received signal ra to reduce ISI, FEXT and/or ANEXT, and 
the signals and transformations have related sizes and delays, the inverse transform of a 
transmitter can be combined with the inverse transform of a receiver, allowing more reduction in 
processing complexity. 

Far-Channel Matrix 

Referring to Figure 3, the receivers associated with transceiver A and the joint processor 
301, receive Ethernet digital signal streams Rl A, R2A, R3A, R4A when the transmitters 315b, 
335b, 355b, 375b are transmitting, and transmitters 315a, 335a, 355a, 375a are not transmitting. 
A channel matrix hb can be used to approximate the signal streams R1A, R2A, R3A, R4A 
having been transmitted as digital signal streams SIB, S2B, S3B, S4B. That is, transceiver A 
receives the digital signal streams SIB, S2B, S3B, S4B, after the digital signal streams pass 
through a far-end Ethernet transmission channel hb. The received digital signal streams Rl A, 
R2A, R3A, R4A can be approximated as (neglecting noise and alien cross-talk): 

Rl A= hbl 1© SIB + hbl2© S2B + hbl3© S3B + hbl4© S4B 
R2A= hb21© SIB + hb22© S2B + hb23© S3B + hb24© S4B 
R3A= hb31© SIB + hb32© S2B + hb33© S3B + hb34© S4B 
R4A= hb41© SIB + hb42© S2B + hb43© S3B + hb44© S4B 

where the hy elements of the hb matrix are the impulse responses of the coupling that generates 
the interference of the received digital signal streams Rl A, R2A, R3A, R4A. 



16 



Teranetics, Attorney Docket #1002-1 



A vector ra can be used to represent a vector that includes the received digital signal 
streams. More specifically, ra = [R1A, R2A, R3A, R4A]. A vector sb can be used to represent 
the transmitted digital signal streams. More specifically, sb = [SIB, S2B, S3B, S4B]. For this 
representation, ra = hb © sb, and 
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This matrix equation of hb, provides a representation of the ISI and FEXT interference 
signals. For example, this matrix can be generated by transmitting known signals, and observing 
the resulting received signals. The hb estimation can be performed, for example, during power- 
up of the Ethernet transceivers. The estimation of the hb matrix allows the determination of 
joint processing (filters) for reducing effects of ISI and FEXT cross-talk. 

The diagonal terms hjj of the hb matrix represent the impulse responses of the ISI signal 
coupling. The off-diagonal terms hij (i^j) of the hb matrix represent the impulse responses of 
the FEXT coupling. The transceiver (transceiver A) has information regarding the ra and sb 
vectors, and can use this information to approximate hb. Typically, the transmitted signal sb is 
known (e.g. a training signal) or can be estimated from the demodulation of ra. The transceiver 
can use the known information of ra and sb to generate processing (filtering) to approximate hb. 
An estimate of hb, denoted as estjib can be adaptively determined by minimizing the error of: 

est_hb © sb - ra = error. 

The estimation can be performed adaptively using a least mean square (LMS) or a 
recursive least square (RLS) algorithm. Both during and after convergence of the algorithms, the 
transceiver computes an estjib. The receiver post-processes estjib, and computes a joint 
matrix equalizer. A pseudo-inverse of est_hb (denoted as Inv_est Jib) can be performed to 
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provide a solution for a joint matrix equalizer for est_hb. Application of the joint matrix 
equalizer Inv_est_hb on the received vector signal ra generates an estimate of the signal sb 
transmitted from transceiver B, denoted est_sb. This joint matrix operation jointly equalizes the 
received vector, and reduces the effects of FEXT coupling across the signal streams. More 
specifically, the receivers compute a minimization of: 

sb - (Inv_est_hb © ra) = sb - (Inv_est_hb © est_hb © sb) = sb - est_sb. 

As previously stated, during calibration, ra and sb are known quantities. Determining an 
estimate of the near-end channel (est_hb) and the joint equalizer (Inv_est_hb), allows joint 
processing on received and transmitted signal streams for reduction of signal interference. The 
joint processing can be simplified for high throughput Ethernet transmission channels, by 
performing the joint processing in a less-complex domain. 

The received digital signal steams ra are transformed (represented by Ra) to the less 
complex domain. The estimate of the far-end channel matrix equalizer Inv_est_hb is 
transformed (represented by Inv_est_Hb) to the less complex domain. An estimate of the 
transmitted digital stream (est_Sb) is determined by multiplying the transformed signal steam Ra 
with the transformed near-end channel matrix (Inv_est_Hb). The estimate of the transmitted 
digital stream (est_S.b) is then transformed (est_sb) back to the original domain. This estimate 
can be post-processed in subsequent functions of the receiver (slicing, error correction, 
scrambling, etc.). 

Other joint processing receivers are possible. For example, the joint matrix equalizer 
Inv_est_hb can be computed directly from the known signals ra and sb, without requiring the 
intermediate step of estimating est_hb. 

Additionally, nonlinear matrix equalizers are possible, such as DFE (decision feedback 
equaliers) or other variants of multi-user detection where the partial estimates of est_sb are used 
iteratively. 
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Partial Time domain processing 

In some applications, the transform domain processing can be shared with partial time 
domain processing for more efficient overall processing. For example consider the case of Near- 
channel matrix ha in which the diagonal elements of ha, hajj have much longer coupling impulse 
responses than the off-diagonal elements of ha, haij (j^j). This situation is common for Ethernet 
transceivers, because the Echo (diagonal) is often longer than the NEXT (off-diagonal). If the 
transform domain processing requires a transform block processor where the size of data to 
process must be larger than the longest coupling impulse response, then all the joint domain 
transformation may be done using the block size of length larger the diagonal elements of ha. 
The diagonal impulse responses can be decomposed into two components hajj = hajjD + hajjT, 
in which hajjD is shorter than hajj and has a length similar to haij (j^j). The new impulse 
responses hajjD can be used in the joint transform domain processor and hjjT can be used in the 
time domain processing. That is, the joint domain transformation is performed on the new Near 
channel; 



haD = 



haUD 


hall 


hal3 


haU 


hall 


hallD 


ha!3 


ha!4 


ha3l 
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ha33D 


ha3\ 


haA\ 
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and the remaining processing of hajj is performed in the time (or other) domain. Another 
alternative includes performing a second joint domain transform processing on the remaining 
hajjT. In another embodiment, a plurality of coupling terms haij have shorter coupling impulse 
responses and the processing is performed in the original domain. 

A third matrix he can be estimated for alien signals. However, known signals generally 
cannot be transmitted to estimate the effects of alien signals. Therefore, some type of blind 
estimation techniques must be used. 
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Performance Advantages Offered by Domain Transformation Processing 

Examples can be provided to demonstrate the advantages of domain transformation joint 
processing of Ethernet transmission signals over time domain joint processing of Ethernet 
transmission signal to reduce the effects of transmission interference. The examples provided 
include DFT transformations, however, other domain transformations can be used. 

Typical Ethernet LAN connections suffer from self-interference (ISI, echo interference) 
and/or cross talk interference (NEXT, FEXT) that spans anywhere from 10 samples to 1000 
samples depending on the type of interference. Echo interference and NEXT interference 
typically require longer spans of 100-1000 samples, and FEXT and ISI require shorter spans of 
10-100 samples. Other factors that effect the number of required samples includes the Ethernet 
cable length and cable type (cat5, 6, 7, etc). For simplification, 100 samples is used for the 
following examples. 

Time Domain Processing 

To implement a single FIR with P real valued coefficients, a standard processor must 
perform P real valued multiplies and adds for each desired filtered output sample. This number 
greatly increases for the Ethernet filtering because the Ethernet transceiver must process multiple 
transmit signals and multiple receive signals. Additionally, the Ethernet transceiver mitigates 
coupling of the multiplicity of signals. 

Simpler Domain Processing 

Alternatively, the signals can be transformed into a domain where filtering is simpler, 
such as the DFT domain. The filtered results can be inverse transformed back to the original 
domain. The nominal complexity of a real valued DFT is of order N*log 2 (N), where N is the 
block size of the DFT. The exact complexity depends on the input being real or complex valued 
and implementation details of the DFT, such as FFT size, radix size, memory vs 

20 



Teranetics, Attorney Docket #1002-1 

area/speed/latency tradeoffs. Filtering in the DFT transform domain dictates a point by point 
multiplication of the N samples of DFT processed data with the N DFT samples of the filter. 
When the application permits processing of N samples at a time, to the first order, filtering N real 
samples requires N*log 2 (N) + 2N + N*log 2 (N) operations. Computing a filter of length P using a 
DFT of size N, generates N-P filtered sampled per transformation. The complexity per sample is 
in the order of (2*log 2 (N) + 2)/(l-N/P). Whenever the length of the FIR, P, is significantly larger 
than 2*log 2 (N) there can be significant simplifications in complexity, and therefore, hardware 
costs and power dissipation. For example, if N = 256 > 2*P for the case of P=100, the 
transformed domain requires about (2*log 2 (256)+2)/(l-100/256) = 30 mult/adds, and the 
standard implementations requires 100 multiplies and adds per output sample. Neglecting other 
HW implementation details such as memory, precision, clock rate, etc, this results in a net gain 
of approximately 3 times. 

Matrix Joint Signal Processing 

The hardware savings are much larger for the situation in which multiple coupled desired 
or un desired signals share a common communication channel, for example, Gigabit Ethernet 
over CAT-5/5e/6 having 4 twisted pairs per cable. In this case, each the 4 information-bearing 
signals transmitted over each of the 4 pairs interferes (cross-talks) with the neighbor 3 pairs 
(FEXT), and the 4 transmitted signals interfere with the four received signals (echo/NEXT). In 
this situation, multiple filtering operations must be performed for each of the 16 two pair 
combinations. For example, a first twisted pair interferes with the first twisted pair (echo), the 
first twisted pair interferes with a second twisted pair (NEXT and FEXT), an so forth for all of 
the twisted pairs. Therefore, the system may require many long FIR implementations 
simultaneously. Joint processing of all these signal (for example, matrix filter implementations) 
using the domain transformation and matrix filters results in very large reductions in the 
hardware required to implement the joint processing. 

Consider two signal sources (xl, x2) and two matrix signal outputs (yl, y2) that are a 
function of the two signal sources. For this case, the input-output relationship is 
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yl = hll©xl+hl2©x2 
y2 = h21©xl + h22©x2 

where '©' denotes convolution, and hij is the filter between input T and output 'j\ The hii or 
diagonal terms model the self-interference (typically ISI or echo) and the off diagonal (hi 2 and 
h21) model the coupling or crosstalk interference (typically FEXT and NEXT and alien NEXT). 
Designating the length P of the 4 FIR hij, the direct implementation of this matrix filter requires 
4*P MAC (multiple and accumulate) per output sample vector (yl, y2). The filter 
implementation can be greatly reduced by using the domain transformation with a DFT. FIR 
filtering in the DFT domain can be implemented with point wise products. The system first 
computes the DFT of each of the FIR responses Hij = DFT(hij), and store these values. In steady 
state the operations required are: 

1. Performing two DFTs of size N, more precisely: 

Xl=DFT(xl) 
X2=DFT(x2). 

2. DFT domain filtering, which is performed by point by point multiplication. For 
example 

Y1=H11*X1+H12*X2 
Y2=H21*X1+H22*X2 

3. Finally, the desired outputs are; 

yl=IDFT(Yl) 
y2=IDFT(Y2). 
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Some minor pre/post processing may be required to account for edge effects, such as "overlap 
and add" or "overlap and save". The total steady state complexity for this implementation of the 
DFT domain transformation is two DFT of size N, 4N point wise MAC per sample in the 
transform domain, and two IDFT. For the DFT case the complexity of the direct and inverse 
transformation is the same. In the more general case of M inputs and L outputs, the complexity 
is M+L DFT/IDFTs of size N and 2*M*L*N point wise MAC for the DFT domain filtering 
(where the 2 is for complex arithmetic on real signals). Including the overlap, the number of 
joint filtered vector output samples per matrix block transformation is (N-P). Thus the operation 
complexity per output sample is of the order 

((M+L)*N*log 2 (N) + 2*M*L*N) / (N-P) 

or equivalently 

((M+L)*log 2 (N) + 2*M*L) / (1-P/N) 

For the direct FIR matrix implementation the total operation complexity is M*L*P per filtered 
vector output sample of size L. Lets revisit the case of high data rate Ethernet systems, where 
M=L=4 and we choose P=100, N=256 for a good balance between overlap and latency. The 
operational complexity of the joint domain transformation is 

((4+4)*log 2 (256) + 2*4*4)/(l-100/256) = 157 
and the standard implementation is 
4*4*100= 1600 

The savings in HW complexity, cost and power for this structure is in the order of 10 times. This 
saving can be even larger for Echo/NEXT cancellers, where P can be more than 500 coefficients. 
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The large savings of this structure can be used to increase the performance (throughput, 
reach) of the transceiver. For P=100 the performance of Echo and NEXT cancellation is poor. 
The low complexity of the proposed structure allows for increasing P greatly for better cross-talk 
cancellation, or to increase M or L for better alien cross-talk cancellation. 

Improved alien cross-talk cancellation 

Similarly the joint transform processing can be used for alien cross-talk mitigation. 
Additional ADCs may be included to allow additional signal streams for improved performance 
or improved cancellation capability. For this situation, the number of inputs to the transform 
processor is larger (M>4), but the number of outputs can remain the same (L=4). Three 
additional ADCs provides a total of M=7 input streams. The total operational complexity of this 
joint transform processor is: 

((7+4)*log 2 (256) + 2*4*4)/(l-100/256) = 197. 

A transceiver with additional input streams with joint transform processing has much 
better alien cross-talk processing cancellation than the standard canceller with M=4, and has 8 
times less operation complexity. 

Although specific embodiments of the invention have been described and illustrated, the 
invention is not to be limited to the specific forms or arrangements of parts so described and 
illustrated. The invention is limited only by the appended claims. 
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